Learning DFA: evolution versus evidence driven state merging

نویسندگان

  • Simon M. Lucas
  • T. Jeff Reynolds
چکیده

Learning Deterministic Finite Automata (DFA) is a hard task that has been much studied within machine learning and evolutionary computation research. This paper presents a new method for evolving DFAs, where only the transition matrix is evolved, and the state labels are chosen to optimize the fit between final states and training set labels. This new procedure reduces the size and in particular, the complexity, of the search space. We present results on the Tomita languages, and also on a set of random DFA induction problems of varying target size and training set density. The Tomita set results show that we can learn the languages with far fewer fitness evaluations than previous evolutionary methods. On the random DFA task we compare our method with the Evidence Driven State Merging (EDSM) algorithm, which is one of the most powerful known DFA learning algorithms. We show that our method outperforms EDSM when the target DFA is small (less than 32 states) and the training set is sparse.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Results of the Abbadingo One DFA Learning Competition and a New Evidence-Driven State Merging Algorithm

This paper first describes the structure and results of the Abbadingo One DFA Learning Competition. The competition was designed to encourage work on algorithms that scale well—both to larger DFAs and to sparser training data. We then describe and discuss the winning algorithm of Rodney Price, which orders state merges according to the amount of evidence in their favor. A second winning algorit...

متن کامل

Mutually Compatible and Incompatible Merges for the Search of the Smallest Consistent DFA

State Merging algorithms, such as Rodney Price’s EDSM (Evidence-Driven State Merging) algorithm, have been reasonably successful at solving DFA-learning problems. EDSM, however, often does not converge to the target DFA and, in the case of sparse training data, does not converge at all. In this paper we argue that is partially due to the particular heuristic used in EDSM and also to the greedy ...

متن کامل

Evidence Driven State Merging with Search

During last year's Abbadingo competition, two people showed how to improve the average-case performance of state merging algorithms for DFA learning. Rodney Price obtained a big improvement by discovering a good order for performing merges. Hugues Juill e obtained a bigger improvement at greater cost by wrapping a stochastic search around a state merging algorithm. Here we establish a new high-...

متن کامل

Gradual probabilistic DFA learning with caching for conversational agents

SUMMARY This paper proposes a method of reducing the cost of gradually constructing task-oriented conversational agents with FSMs by collecting example dialogues. Probabilistic-DFA learning algorithms with the state merging method are available for the FSM-based dialogue model. However, these algorithms must learn the whole model again as often as the example data increase. We proposed a learni...

متن کامل

A Stochastic Search Approach to Grammar Induction

This paper describes a new sampling-based heuristic for tree search named SAGE and presents an analysis of its performance on the problem of grammar induction. This last work has been inspired by the Abbadingo DFA learning competition [14] which took place between Mars and November 1997. SAGE ended up as one of the two winners in that competition. The second winning algorithm, rst proposed by R...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003